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SYSTEMS 
BRANCH 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number 
Source: 



Date Processed by STIC: ^/2.(2j>0/ 




THE ATTACHED PRINTOUT EXPLAmS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO TOE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF TOIS PRINTOUT IN YOUR NEXT COMMUNICATION TO TOE 
APPLICANT, WITO A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF TOIS PRINTOUT, WITO A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703.308-4216. 
PATENTIN 2.1 e-mail hdp: Datin21helD(g).usDto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: Datin3help(g).usDto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS. PLEASE USE THE CHECKER 
VERSTON 3.0 PROGRAM. ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interfece to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from th«^ USPTO website at the following address; 

http://www.uspto.gov/web/offices/pac/checker 
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RAW SEQUENCE LISTING DATE: 08/02/2001 (J^^ 

PATENT APPLICATION: US/09/909,566 TIME: 11:03:33 

Input set : A:\BB1465 US NA Seq Listing.txt DoeS Nct Comply 

Output set: N:\cRF3\o8022ooi\i909566.raw Corrected Diskette Needed 

3 <110> APPLICANT: Gaboon, Edgar B ^ ^.u • * 

5 <120> TITLE OF INVENTION: A Cytochrome P450 enzyme associated with the synthesis ot 

12-epoxy 

6 fatty acids 

8 <130> FILE REFERENCE: BB1465 US NA 

10 <140> CURKENT APPLICATION NUMBER: US/09/909,566 

11 <141> CURRENT FILING DATE: 2001-07-20 

13 <150> PRIOR APPLICATION NimS£Ri..J0/219833 - , / ^ iJ ' //l^/ 

W-> 14 <151> PRIOR FILING DATE:< gIly 21, 2000 ^ 300^-0?- J / .XJ^ ^^^^ ^d^'^^ 

16 <160> NUMBER OF SEQ ID N0ST~7 
IB <170> SOFTWARE: Microsoft Office 97 

20 <210> SEQ ID NO: 1 

21 <211> LENGTH: 1733 

22 <212> TYPE: DNA 

23 <213> ORGANISM: Euphorbia lagascae 

25 <400> SEQUENCE: 1 

26 gcataaaagg aaaatggagc agaaaaatct ctcttttccg agcattttaa taagttttct 60 

27 gcttgtttta atcttagtag tagtcatgag gttgtggaag aaacagaatc cacctccagg 120 

28 gccatggaag tttcctatca taggtaatct tcctcattta ttactcactt ctgatctagg 180 

29 ccatgaacgt tttagagcct tggctcaaat ttatggacct gttatgagtc ttcaaattgg 240 

30 ccaagtttca gctgttgtca tttcttcagc tgaagcagcc aaagaggtta tgaaaactca 300 

31 ggctgatgcc ttcgcccaac gccctatcgt cttggacgca cagattgtgt tttataatcg 360 

32 gaaagatgtc ttgtttgctt catatggaga tcactggagg cagatgaaga aaatttggat 420 

33 acttgaattt ctgagtgcca aaaaagttca atcctccagg ttaatccgag aggaagaaat 480 

34 ggaggatgcc atcacattcc tccgttcgaa agccggatct ccggtcaata ttacaaagat 540 

35 catttatggc attataattt cgatcatgat aagaacatcc gttggtaatt gtaagcaaaa 600 

36 agaaagattg ctgagtgttg ccgatgcagt caatgaggca gcgacgagtt ttggcaccgc 660 

37 agacgctttt ccgacgtgga aattacttca ctatatcatt ggagctgagt caaaacccag 720 

38 gcgtttgcat caggagattg acgatatact tgaagagatt cttaatgaac acaaagccaa 780 

39 taagcctttt gaagcggata acttaatgga tgttctattg aatcttcaaa aaaatggaaa 840 

40 cgttccagtg ccagtgacaa acgaaagcat caaagcatcc gttttgcaaa tgtttactgc 900 

41 cgggagcgaa acaacttcga aagctacaga atgggtaatg gcagagctga tgaaaaatcc 960 

42 aactgaacta agaaaagcac aagaagaagt tagacaagta tttggtgaaa tgggaaaagt 1020 

43 tgatgaatca agatttcatg atttgaaatt cttcaagtta gtggttaaag aaactctaag 1080 

44 attacatcct ccggttgtct tgattccgag ggagtgtaga gaaacaacac gaattgatgg 1140 

45 atatgaaatt catccgaaca ctcgaattgt tgtgaatgct tgggcgatag gaagagatcc 1200 

46 taatacttgg tcggaacctg gaaagtttaa cccagaaagg tttaaagatt gtgcaattga 1260 

47 ttataaaggg acgacatttg aactggtacc atttggtgca ggaaaaagaa tatgtcctgg 1320 

48 cattacttca gctattacca atttggagta tgtcattata aatctattat atcattttaa 1380 

49 ttgggaactg gccgatggaa ttacacctca aacacttgat atgactgaag ctattggcgg 1440 

50 tgctctcagg aaaaaaatag atcttaagtt gattcctatt ccatatcaag ttagcttagg 1500 

51 ctcaaatatt tcttgattac ataggagggt tgaaatatat ataataaact ttaattaacg 1560 

52 atgttctaat atggtttggg tgagttataa taggttttcc accgatcata taagtagcct 1620 

53 tctttgatgg atgggttaga ttataatgag ttgtgggttg gatttttaga tgggttaaat 1680 

54 gatttggatg gataataata aattgaaatg ttttcttttt caaatccgaa aaa 1733 

57 <210> SEQ ID NO: 2 

58 <211> LENGTH: 500 

59 <212> TYPE: PRT 



file://C:\CRF3\Outhold\VsrI909566.htm 



8/2/01 



Page 2 of 7 



RAW SEQUENCE LISTING I^ATE: 08/02/2001 

PATENT APPLICATION: US/09/909,566 TIME: 11:03:33 

Input Set : A:\BB1465 US NA Seq Listing.txt 
Output Set: N:\CRF3\08022001\I909566 , raw 

60 <213> ORGANISM: Euphorbia lagascae 

) Ser He Leu He Ser I 

4 ■■' 

Leu Val Leu He Leu Val Val Val Met Arg Leu xrp i.ys j 

-ir* 9S 30 

67 

Pro Pro Pro Gly Pro Trp Lys Phe Pro He lie uiy i 

-.tr An 45 

70 
72 
73 

^-.^ m,,^ ni.r Dr-^ vi^t Ser Leu Gin lie Giy u±n vax c>t:x t 

80 

76 65 

w^i c^r- Cor- Ala mil Ala Ala LYS UiU vax rieu uyi. - 

95 

79 



Glu 


Gin 


Lys 


Asn 


Leu 


Ser 


Phe 


Pro 


Ser 


He 


Leu 






5 










10 






Val 


Leu 


He 


Leu 


Val 


Val 


Val 


Met 


Arg 


Leu 


Trp 






20 










25 






Gly 


Pro 


Pro 


Gly 


Pro 


Trp 


Lys 


Phe 


Pro 


He 


He 




35 










40 








Phe 


Leu 


Leu 


Thr 


Ser 


Asp 


Leu 


Gly 


His 


Glu 


Arg 


50 










55 










60 


He 


Tyr 


Gly 


Pro 


Val 


Met 


Ser 


Leu 


Gin 


He 


Gly 






70 










75 




Val 


He 


Ser 


Ser 


Ala 


Glu 


Ala 


Ala 


Lys 


Glu 


Val 






85 










90 






Asp 


Ala 


Phe 


Ala 


Gin 


Arg 


Pro 


He 


Val 


Leu 


Asp 




100 










105 








Xyr 


Asn 


Arg 


Lys 


Asp 


Val 


Leu 


Phe 


Ala 


Ser 


Tyr 


115 










120 










Gin 


Met 


Lys 


Lys 


He 


Trp 


He 


Leu 


Glu 


Phe 


Leu 


130 








135 










140 


Gin 


Ser 


Ser 


Arg 


Leu 


He 


Arg 


Glu 


Glu 


Glu 


Met 










150 










155 




Phe 


Leu 


Arg 


Ser 


Lys 


Ala 


Gly 


Ser 


Pro 


Val 


Asn 






165 










170 






Tyr 


Gly 


He 


He 


He 


Ser 


He 


Met 


He 


Arg 


Thr 


180 










185 









82 100 -^"3 

Phe Tyr Asn Arg Lys Asp Val Leu Phe Ala Ser Tyr 

85 
87 

val Gin ser Ser Arg Leu He Arg Glu Glu Glu Met ^lu tK^y nxa 
91 145 150 155 160 

Phe Leu Arg Ser Lys Ala Gly Ser Pro Val Asn He Thr 
94 165 170 175 

He Tyr Gly He He He Ser He Met He Arg Thr Ser 
97 180 ^""^ 

99 Cys Lys Gin Lys Glu Arg Leu Leu £ 

102 Ala Ala Thr Ser Phe Gly Thr Ala Asp Ala Phe Pro Thr Trp Lys Leu 

103 210 215 220 

105 Leu His Tyr He He Gly Ala Glu Ser Lys Pro Arg Arg Leu His Gin 

106 225 230 235 240 

108 Glu He Asp Asp He Leu Glu Glu He Leu Asn Glu His Lys Ala Asn 

109 245 250 255 

111 LYS Pro Phe Glu Ala Asp Asn Leu Met Asp Val Leu Leu Asn Leu Gin 

112 260 265 270 

114 Lys Asn Gly Asn Val Pro Val Pro Val Thr Asn Glu Ser He Lys Ala 

115 275 280 285 

117 ser val Leu Gin Met Phe Thr Ala Gly Ser Glu Thr Thr Ser Lys Ala 

118 290 295 



124 
126 
127 
129 
130 



Thr 


Glu 


Trp 


Val 


Met 


Ala 


Glu 


Leu 


Met 


Lys 


305 










310 










Lys 


Ala 


Gin 


Glu 


Glu 


Val 


Arg 


Gin 


Val 


Phe 








325 










330 


Asp 


Glu 


Ser 


Arg 


Phe 


His 


Asp 


Leu 


Lys 


Phe 






340 










345 




Glu 


Thr 


Leu 


Arg 


Leu 


His 


Pro 


Pro 


Val 


Val 






355 










360 






Arg 


Glu 


Thr 


Thr 


Arg 


He 


Asp 


Gly 


Tyr 


Glu 





205 






Pro 


Thr 


Trp 


Lys 


220 








Arg 


Arg 


Leu 


His 


Glu 


His 


Lys 


Ala 








255 


Leu 


Leu 


Asn 


Leu 






270 




Glu 


Ser 


He 


Lys 




285 






Thr 


Thr 


Ser 


Lys 


300 








Pro 


Thr 


Glu 


Leu 


Glu 


Met 


Gly 


Lys 








335 


Lys 


Leu 


Val 


Val 






350 




He 


Pro 


Arg 


Glu 




365 






His 


Pro 


Asn 


Thr 



315 320 
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RAW SEQUENCE LISTING DATE: 08/02/2001 

PATENT APPLICATION: US/09/909,566 TIME: 11:03:33 

Input Set : A:\BB1465 US NA Seq Listing.txt 
Output set: N:\CRF3\08022001\I909566.raw 

133 370 375 380 

135 He Val Val Asn Ala Trp Ala He Gly Arg Asp Pro Asn Thr Trp Ser 

136 385 390 395 400 

138 Glu Pro Gly Lys Phe Asn Pro Glu Arg Phe Lys Asp Cys Ala He Asp 

139 405 410 415 

141 Tyr Lys Gly Thr Thr Phe Glu Leu Val Pro Phe Gly Ala Gly Lys Arg 

142 420 425 430 

144 He Cys Pro Gly He Thr Ser Ala He Thr Asn Leu Glu Tyr Val He 

145 435 440 445 

147 He Asn Leu Leu Tyr His Phe Asn Trp Glu Leu Ala Asp Gly He Thr 

148 450 455 460 

150 Pro Gin Thr Leu Asp Met Thr Glu Ala He Gly Gly Ala Leu Arg Lys 

151 465 470 475 480 

153 Lys He Asp Leu Lys Leu He Pro He Pro Tyr Gin Val Ser Leu Gly 

154 485 490 495 

156 Ser Asn He Ser 

157 500 

160 <210> SEQ ID NO: 3 

161 <211> LENGTH: 502 

162 <212> TYPE: PRT 

163 <213> ORGANISM: Capsicum annuum 

165 <400> SEQUENCE: 3 

166 Met Glu He Gin Phe Thr Asn Leu Val Ala Phe Leu Leu Phe Leu Ser 
167 



10 15 



169 Ser He He Leu Leu Leu Lys Lys Trp Lys Thr Gin Lys Leu Asn Leu 

170 20 25 30 

172 Pro Pro Gly Pro Trp Lys Leu Pro Phe He Gly Ser Leu His His Leu 

173 35 40 45 

175 Ala Val Ala Gly Pro Leu Pro His His Gly Leu Lys Asn Leu Ala Lys 

176 50 55 ^° u 1 

178 Leu Tyr Gly Pro Leu Met His Leu Arg Leu Gly Glu He Pro Thr Val 

179 65 70 . . 

181 He He Ser Ser Pro Arg Met Ala Lys Glu Val Leu Lys Thr His Asp 

182 85 90 95 

184 Leu Ala Phe Ala Thr Arg Pro Lys Leu Val Val Ala Asp He Val His 

185 100 105 110 

187 Tyr Asp Ser Thr Asp He Ala Phe Ser Pro Tyr Gly Glu Tyr Trp Arg 

188 115 120 125 

190 Gin He Arg Lys He Cys He Leu Glu Leu Leu Ser Ala Lys Met Val 

191 130 135 140 • 

193 Lys Phe Phe Ser Ser He Arg Gin Asp Glu Leu Ser Met Met Val Ser 

194 145 150 155 160 

196 Ser He Arg Thr Met Pro Asn Phe Pro Val Asn Leu Thr Asp Lys He 

197 165 170 175 

199 Phe Trp Phe Thr Ser Ser Val Thr Cys Arg Ser Ala Leu Gly Lys He 

200 180 185 190 

202 Cys Arg Asp Gin Asp Lys Leu He He Phe Met Arg Glu He He Ser 

203 195 200 205 

205 Leu Thr Gly Gly Phe Ser He Ala Asp Phe Phe Pro Thr Trp Lys Met 
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RAW SEQUENCE LISTING DATE: 08/02/2001 

PATENT APPLICATION: US/09/909,566 TIME: 11:03:33 

Input Set : A:\BB1465 US NA Seq Listing.txt 
Output Set: N:\CRF3\08022001\I909566.raw 

206 210 215 220 

208 Leu His Asp Val Gly Gly Ser Lys Thr Arg Leu Leu Lys Ala His Arg 

209 225 230 235 240 

211 Lys He Asp Glu He Leu Glu His Val Val Asn Glu His Lys Gin Asn 

212 245 250 255 

214 Arg Ala Asp Gly Gin Lys Gly Asn Gly Glu Phe Gly Gly Glu Asp Leu 

215 260 265 270 

217 He Asp Val Leu Leu Arg Val Arg Glu Ser Gly Glu Val Gin He Ser 

218 275 280 285 

220 He Thr Asp Asp Asn He Lys Ser He Leu Val Asp Met Phe Ser Ala 

221 290 295 300 

223 Gly Ser Glu Thr Ser Ser Thr Thr He He Trp Ala Leu Ala Glu Met 

224 305 310 315 320 

226 Met Lys Lys Pro Ser Val Leu Ala Lys Ala Gin Ala Glu Val Arg Gin 

227 325 330 335 

22 9 Val Leu Lys Glu Lys Lys Gly Phe Gin Gin He Asp Leu Asp Glu Leu 
230 340 345 350 

232 Lys Tyr Leu Lys Leu Val He Lys Glu Thr Leu Arg Met His Pro Pro 

233 355 360 365 

235 He Pro Leu Leu Val Pro Arg Glu Cys Met Lys Asp Thr Lys He Asp 

236 370 375 380 

238 Gly Tyr Asn He Pro Phe Lys Thr Arg Val He Val Asn Ala Trp Ala 

239 385 390 395 400 

241 He Gly Arg Asp Pro Glu Ser Trp Asp Asp Pro Glu Ser Phe Ser Pro 

242 405 410 415 

244 Glu Arg Phe Glu Asn Ser Ser Val Asp Phe Leu Gly Ser His His Gin 

245 420 425 430 

247 Phe He Pro Phe Gly Ala Gly Arg Arg He Cys Pro Gly Met Leu Phe 

248 435 440 445 

250 Gly Leu Ala Asn Val Gly Gin Pro Leu Ala Gin Leu Leu Tyr His Phe 

251 450 455 460 

253 Asp Arg Lys Leu Pro Asn Gly Gin Ser His Glu Asn Leu Asp Met Thr 

254 465 470 475 480 

256 Glu Ser Pro Gly He Ser Ala Thr Arg Lys Asp Asp Leu Val Leu He 

257 485 490 495 

259 Ala Thr Pro Tyr Asp Pro 

260 500 - 1 AJ jh 

263 <210> SEQ ID NO: 4 /O ^ W A<^>^m^^ ^^^^ 

264 <211> LENGTH: 51 /^lyrg ^ /OM^^ ^ 

265 <212> TYPE: YMAK ^J^^^-^ " 



266 <213> ORGANISM :( ^ynthetic co nstruct 

268 <400> SEQUENCE: 4 

269 tcaaggagaa aaaaccccgg atccatggag cagaaaaatc tctcttttcc g 51 

272 <210> SEQ ID NO: 5 

273 <211> LENGTH: 35 _____ . .^.^^ 

274 <212> TYPE: DNA 





275 <213> ORGANISM (^synthetic construct 



277 <400> SEQUENCE: 

278 ggccagtgaa ttgtaatacg actcactata gggcg 35 
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RAW SEQUENCE LISTING DATE: 08/02/2001 

PATENT APPLICATION: US/09/909,566 TIME: 11:03:33 

Input Set : A:\BB14 65 US NA Seq Listing .txt 
Output Set: N:\CRF3\08022001\I909566.raw 



281 <210> SEQ ID NO: 6 

282 <211> LENGTH: 35 

283 <212> TYPE: DNA 




284 <213> ORGANISM^ synthetic construct^ 

286 <400> SEQUENCE: 

287 gcggccgcga attcggaaaa tggagcagaa aaatc 35 

290 <210> SEQ ID NO: 7 

291 <211> LENGTH: 35 

292 <212> TYPE: DNA^ ^ ^ 

293 <213> ORGANISM: Qimthetic construct^' 

295 <4 00> SEQUENCE: 7 

296 gcggccgcgg atccttagaa catcgttaat taaag 35 
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VERIFICATION SUMMARY DATE: 08/02/2001 

PATENT APPLICATION: US/09/909,566 TIME: 11:03:34 

Input Set : A:\BB14 65 US NA Seq Listing. txt 
Output Set: N:\CRF3\08022001\I909566.raw 

L:10 M:270 C: Current Application Number differs. Replaced Application Number 

L-ll M-271 C: Current Filing Date differs, Replaced Current Filing Date 

L-14 M-256 W: Invalid Numeric Header Field, Wrong Prior FILING DATE: YYYY-MM-DD 
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